1 

Capacity Scaling of Wireless Ad Hoc Networks: 

Shannon Meets Maxwell 

Si-Hyeon Lee, Student Member, IEEE, and Sae- Young Chung, Senior Member, IEEE 



Abstract — In this paper, we characterize the information- 
theoretic capacity scaling of wireless ad hoc networks with n 
randomly distributed nodes. By using an exact channel model 
from Maxwell's equations, we successfully resolve the conflict in 
the literature between the linear capacity scaling by Ozgiir et 
al. and the degrees of freedom limit given as the ratio of the 
network diameter and the wavelength A by Franceschetti et al. 
In dense networks where the network area is fixed, the capacity 
scaling is given as the minimum of n and the degrees of freedom 
limit A to within an arbitrarily small exponent. In extended 
networks where the network area is linear in n, the capacity 
scaling is given as the minimum of n and the degrees of freedom 
limit ^nA 1 to within an arbitrarily small exponent. Hence, we 
recover the linear capacity scaling by Ozgiir et al. if A = 0(n _1 ) 
in dense networks and if A = 0(n~ 1,/2 ) in extended networks. 
Otherwise, the capacity scaling is given as the degrees of freedom 
limit characterized by Franceschetti et al. For achievability, a 
modified hierarchical cooperation is proposed based on a lower 
bound on the capacity of multiple-input multiple-output channel 
between two node clusters using our channel model. 

Index Terms — Capacity scaling, channel correlation, coopera- 
tive multiple-input multiple-output (MIMO), degrees of freedom, 
hierarchical cooperation, Maxwell's equations, physical limit, 
wireless ad hoc networks. 



I. Introduction 

PIONEERED by Gupta and Kumar in HI, the capacity 
scaling in wireless ad hoc networks has been actively 
studied over the last decade. In this research, we consider n 
uniformly and independently distributed nodes in a unit area 
(dense network) or an area of n (extended network), each of 
which wanting to communicate to a random destination at 
the same rate of R(n). The goal is to find out the maximally 
achievable scaling of the aggregate throughput T(n) = nR(n) 
with n. In their seminal paper JT), Gupta and Kumar showed 
that throughput scaling higher than 0(y/n) cannot be achieved 
if each node treats interference as noise and that the multihop 
scheme can achieve 9(y / n/logn)Q This gap was closed 
in 0, where it was shown that the multihop via percolation 

This work was partially supported by the Defense Acquisition Program 
Administration and the Agency for Defense Development under the contract 
UD060048AD. The material in this paper was presented in part at the IEEE 
International Symposium on Information Theory, Toronto, Canada, July 2008 
and at the IEEE International Symposium on Information Theoiy, Austin, 
USA, June 2010. 

S.-H. Lee and S.-Y. Chung are with the Department of Electrical Engineer- 
ing, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 
South Korea (e-mail: sihyeon@kaist.ac.kr; sychung@ee.kaist.ac.kr). 

'in this paper, we use the following asymptotic notations (2): (i) /("■) = 
0(g(n)) if f(n) < kg(n) as n tends to infinity for some constant k. (ii) 
/(n) = 0(g(n)) if kig(n) < f(n) < k2g(n) as n tends to infinity for 
some constants k± and k^. (iii) f(n) = Q(g(n)) if f(n) > kg(n) as n 
tends to infinity for some constant k. 



theory can achieve 0(^/ri). To information theorists, a natural 
question is what the information-theoretic capacity scaling is 
without such underlying physical-layer assumptions. 

The information-theoretic capacity scaling is highly depen- 
dent on the channel model. Furthermore, it is important to use 
a realistic channel model to get results that are closer to reality. 
In wireless networks in line-of-sight (LOS) environments, 
where the spatial locations of nodes are fixed with sufficiently 
large inter-node separation compared to the wavelength, the 
baseband-equivalent channel response between two nodes k 
and i is given as 

— ex P (-j-d ik ) (1) 

from Maxwell's equations where j = y— T, dik is the distance 
between nodes k and i, A denotes the wavelength 4- where 
c is the speed of light and f c is the carrier frequency, and 
G = j^t by Friis' formula where G/ is the product of the 
transmit and receive antenna gains. 

Recently, Ozgiir et al. characterized the information- 
theoretic capacity scaling in (4). Instead of using the exact 
channel model (Q]i with a distance dependent phase, however, 
they assumed that the baseband-equivalent channel response 
between two nodes k and i is given as 

—^-exp(j6 ik ) (2) 

Uik 

where 8^ is independent and identically distributed (i.i.d.). 
For this channel model, the capacity scaling is shown to 
be arbitrarily close to linear in both dense and extended 
networks, which means that each source can communicate to 
its destination as if there were no interference. A key com- 
ponent to achieve such a scaling is the cooperative multiple- 
input multiple-output (MIMO) transmission between two node 
clusters whose sizes are comparable to that of the network. 
If the penalty to form such a virtual MIMO is negligible, 
the classical MIMO results |5j, (6) under the i.i.d. channel 
phase assumption make the linear throughput scaling possible. 
Such an overhead is indeed shown to be arbitrarily small by 
using hierarchical cooperation (HC). In the HC scheme, each 
cluster forms a virtual antenna array using MIMO transmis- 
sions between small scale clusters inside it. Similarly, each 
small scale cluster forms a virtual antenna array by MIMO 
transmissions between even smaller clusters inside it. This 
builds up a hierarchy and a plain time division multiple access 
(TDMA) is performed at the bottom hierarchy. 

The i.i.d. phase assumption in (0) makes the throughput 
analysis easier in 0), but such an artificial assumption can 



2 



lead to results contradicting the physics. Recently, the linear 
capacity scaling in turned out to be contradictory to 
the physical limit on degrees of freedom (DoF) when A is 
not sufficiently small. In Q, Franceschetti et al. showed, 
using Maxwell's equations, that DoF in extended networks 
is limited by the ratio of the network diameter y/n and 
A. By rescaling the network size, the DoF limit becomes 
A -1 in dense networks. This is a fundamental limitation 
independent of power attenuation and fading models. Hence, 
the linear capacity scaling in (4) is in fact not attainable for 
A = and A = 0(n _1 / 2 ) in dense and extended 

networks, respectively. The cause of such a conflict is the 
i.i.d. channel phase assumption in [4] that ignores the channel 
correlation due to the distance dependent channel phase. 

Two contradictory results fl4], J7) highlight the importance 
of exact channel models based on Maxwell's equations. 
Thus, the ultimate goal would be the characterization of 
the information-theoretic capacity scaling of wireless ad hoc 
networks from Maxwell's equations without any artificial 
assumptions. In this paper, we accomplish this goal by charac- 
terizing the information-theoretic capacity scaling of wireless 
ad hoc networks using an exact channel model from Maxwell's 
equations in LOS environments. In dense networks, we es- 
tablish the capacity scaling given as min{n, A -1 } to within 
an arbitrarily small exponent. Hence, the capacity scaling is 
linear in n if A = 0(nT l ). Otherwise, the capacity scaling 
is given as the DoF limit A -1 characterized by Franceschetti 
et al. In extended networks, the capacity scaling is given as 
min{n, ^} to within an arbitrarily small exponent. Hence, 
the capacity scaling is linear in n if A = 0(n -1 / 2 ) and is 
given as the DoF limit characterized by Franceschetti et 
al. otherwise. 

Since the converse is straightforward from the previous 
works in j4|, Q, our main contribution is to show the 
achievability. We note that under the far-field assumption, i.e., 
A is much smaller than the inter-node separation yf^, where 

\/A denotes the network diameter, the DoF limit is in 
general higher than the throughput scaling s/n of the multihop 
via percolation theory of Q. For achievability, we modify 
the HC scheme in |4j according to an achievable MIMO 
rate between two node clusters. We show that the capacity 
of the MIMO channel between two node clusters is at least 
proportional to the minimum of the number of nodes in the 
cluster and the product of the ratio of the cluster diameter and 
A and the angular spread between clusters. In our modified 
HC scheme, only a subset of nodes in a cluster performs 
the MIMO transmission such that the number of participating 
nodes is proportional to the achievable MIMO rate, whereas 
all nodes in the cluster participate in the MIMO transmission 
in the HC scheme of g). 

The organization of this paper is as follows. In Section [TT] 
the system model is presented. In Section [Till we present the 
main theorems on the capacity scaling and their implications. 
In Section llVl a modified HC scheme is constructed according 
to an achievable MIMO rate between node clusters. We 
conclude this paper in Section [V] 

The following notations will be used in the paper. CAf(0, K) 



denotes the circularly symmetric complex Gaussian random 
vector with zero mean and covariance matrix of K. R and N 
denote the set of real numbers and the set of natural numbers, 
respectively. E[ ] and (■)* denote the expectation and conjugate 
transpose, respectively. (•)„, denotes the modulo-m operation. 
(x) + denotes the positive part of x, i.e., 



(*)" 



x if x > 
if x < 



For two integers u and v such that u < v, [u : v] denotes the 
set {u, u+1, . . . , v}. For a set S, \S\ denotes the cardinality of 
the set. The logarithm function log is base 2 unless otherwise 
specified. 

II. System model 

There are n uniformly and independently distributed nodes 
in a square of unit area (called a dense network) or a square 
of area n (called an extended network). It is assumed that the 
node locations are fixed for the duration of the communication. 
Each node has an average transmit power constraint of P and 
the network is allocated a total bandwidth B around the carrier 
frequency f c ^> B. The wavelength A = j- is assumed to be 
much smaller than the average separation distance between 
neighbor nodes given as 0(n~ 1 / 2 ) and 0(1) for dense and 
extended networks, respectively. Furthermore, we assume a 
very mild lower bound on A such that A > nT^ for an 
arbitrarily large constant /.t > 1/2. We assume that A is a 
monotonically non-increasing function of n. This corresponds 
to using higher carrier frequencies to handle more traffic due 
to the increased number of nodes. Every node is a source and a 
destination simultaneously, and the n source-destination pairs 
are determined randomly. Every source wants to communicate 
to its destination at the same rate of R(n, A). The aggregate 
throughput T(n, A) of the network is given as nR(n, A). 

We consider the LOS environment, i.e., no multi-path 
fadingH From Maxwell's equations in far-fields, the discrete- 
time baseband-equivalent channel gain between nodes k and 
i at time m is given as 



H ik [m] = 



G ( 2 % r 1 
— — r exp -j—d ik [m\ 

dik \m\ V A 



(3) 



where j = y— 1, dm [mj is the distance between nodes k and 
i at time m, and G = ttt^t by Friis' formula, where Gi is the 
product of the transmit and receive antenna gainslj Note that 
if Gi is fixed, G vanishes as A tends to zero. In extended 
networks, however, we assume that G is a constant since 
we can increase Gi proportional to A~ 2 without increasing 
the physical size of the antennas beyond a small fraction of 
the inter-node separation^ In dense networks, it is proper to 

2 Our analysis can be extended to cases where there is multi-path fading. 
However, we believe that having a finite number of paths would not affect 
the throughput scaling laws. 

3 A channel model with a path-loss exponent larger than two is considered 
in Appendix |Pl 

4 For each node, we can deploy ©(A" 1 ) antennas vertically that form an 
antenna array of length 0(1), which gives a vertical beamforming gain of 
0(A ). Hence, the product of the transmit and receive beamforming gains 
can be 0(A~ 2 ). 
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assume that the node size is upper-bounded by fcn" 1 / 2 for 
some constant k since the network area is now fixed. Hence, 
G is assumed to be for dense networks because we 

can make Gi proportional to A~ 2 n _1 @ 

The discrete-time baseband-equivalent output Yj, [m] at node 
i at time m is given as 

n 

YiH = ]T H lk [m]X k [m] + Z 2 [m] 
fc=i 

where Xfc[m] is the discrete-time baseband-equivalent input 
at node k at time m and Z;[to] is the additive Gaussian noise 
£/V(0, 1) at node i at time m. The channel state information 
(CSI) is available only at the receivers. From now on, we will 
omit the time index for notational convenience. 

III. Main result 

We first present a lower and an upper bound on the capacity 
scaling for dense networks in Theorems Q] and [2] respectively. 
In Theorems [3] and [4] we present a lower and an upper bound 
on the capacity scaling for extended networks, respectively]^ 

Theorem 1: Consider a network of n nodes on a unit area, 
in which n source-destination pairs are assigned arbitrarily. 
For any e > 0, a scheme exists that achieves an aggregate 
throughput 

T(n, A) > A^minlA- 1 ,™} 1 ^ 

with high probability^] where K\ f is a positive constant 
independent of both n and A. 

The aggregate throughput scaling in Theorem Q] can be 
achieved by the modified HC scheme constructed in Sec- 
tion |IV] Note that Theorem Q] holds even if source-destination 
pairing is arbitrary. 

In the following theorem, we show an upper bound on 
the throughput scaling. If the source-destination pairs can be 
determined according to the node locations, then an aggregate 
throughput scaling of 0(n) would be achievable for any A 
by letting each of the source-destination pairs be nearest 
neighbors. Therefore, for the upper bound on the capacity 
scaling, we limit our interest to random source-destination 
pairing. 

Theorem 2: Consider a network of n nodes on a unit area, 
in which n source-destination pairs are assigned randomly. 
The aggregate throughput in the network is upper-bounded as 

T(n, A) < A 2 mm{A _1 (logA~ 2 ) 2 ,rilogn} (4) 

with high probability, where is a positive constant inde- 
pendent of both n and A. 

5 In dense networks, we can vertically deploy ©(A^n -1 / 2 ) antennas for 
each node that form an antenna array of length ©(n -1 / 2 ). 

6 We note that a similar result was also independently shown in (5] based on 
the same channel model as in (9) at the same time this paper was submitted. In 
this paper, we derive a lower bound on the MIMO transmission between two 
node clusters without any artificial assumptions, which is the key ingredient 
in the achievability, whereas the work in (§3 assumed that interfering signals 
from other transmitting nodes in the network to the MIMO transmission are 
independent. In addition, the effect of A on G is considered in this paper, but 
not in (8). 

7 With probability approaching 1 as n tends to infinity. 



The first term in the minimum in (O is the DoF limit shown 
in Qll The second term in the minimum in dU is obtained 
from the fact that the transmission rate from a source to its 
destination is upper-bounded by the capacity of the single- 
input multiple-output (SIMO) channel between the source and 
the remaining nodes in the network (see, e.g., Theorem 3.1 
in 0). 

Theorems Q] and [2] establish the capacity scaling in dense 
networks to within an arbitrarily small exponent. To see the 
effect of A on the capacity scaling, let A = for (3 > i. 
Note that the condition f3 > i is needed for the far-field 
approximation to hold. If /3 > 1, the capacity scaling is 
arbitrarily close to linear. If | < /3 < 1, the capacity scaling 
is given as the DoF limit. 

Now, we give an achievable aggregate throughput scaling 
in extended networks. 

Theorem 3: Consider a network of n nodes on an area n, 
in which n source-destination pairs are assigned arbitrarily. 
For any e > 0, a scheme exists that achieves an aggregate 
throughput 

T(n, A) > i^minlV^A- 1 ,™} 1 ^ 

with high probability, where K^ t is a positive constant inde- 
pendent of both n and A. 

The aggregate throughput scaling in Theorem [3] can be 
achieved by the modified HC scheme in Section IPVl 

For random source-destination pairing, the following the- 
orem shows an upper bound on the capacity scaling whose 
exponent is arbitrarily close to that of the lower bound in 
Theorem [3] 

Theorem 4: Consider a network of n nodes on an area n, in 
which n source-destination pairs are assigned randomly. The 
aggregate throughput in the network is upper-bounded as 

T(n, A) < K 4 min {VnX^ 1 (\og(n\- 2 ) f ,nlogn} (5) 

with high probability, where K& is a positive constant inde- 
pendent of both n and A. 

The first term in the minimum in (f5]) is the DoF limit shown 
in Q, and the second term in the minimum in (O is obtained 
similarly as the derivation of the second term in the minimum 
in ©. 

Similarly as in dense networks, let A = n~P for (3 > to 
see how A affects the capacity scaling in extended networks. 
Note that (3 — means that A is a constant, regardless of n. 
If P > \, the capacity scaling is arbitrarily close to linear. If 
< /3 < 5, the capacity scaling is given as the DoF limit. 

Remark 1: The exponent (3 signifies the increase of f c to 
handle more traffic as n increases. For example, consider a 
network with an area of 0.01km 2 with n = 100 and f c = 
300MHz (A = lm). Then, the DoF limit is an order of 100, 
and hence, the network is not DoF limited. Now, assume that 
the network size grows to an area of 1km 2 with n = 10000. 
If /3 = 0, i.e., the carrier frequency remains the same, then 
the network becomes DoF limited since the DoF limit is an 

8 In (7], the DoF limit in extended networks is studied and it is shown to 
be determined by the ratio of the network diameter and the wavelength. In 
dense networks, the DoF limit can be obtained by rescaling the network size. 
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Fig. 1. Cooperative MIMO between Ct and Cr with side length D and 
distance L > 2D between the centers 



order of 1000. Now, if /3 = 1, i.e., the carrier frequency is 
increased to 3GHz (A = 0.1m), then the network is not DoF 
limited since the DoF limit is now an order of 10000. 

IV. Modified Hierarchical Cooperation 

In this section, Theorems Q] and [3] are proved by constructing 
a modified HC scheme. Let us first consider a cooperative 
MIMO between two node clusters, which is the key to 
the construction of the modified HC scheme. Consider N 
independently and uniformly distributed nodes in each of two 
horizontally aligned square areas with side length D and 
distance L > 2D between the centers, as shown in Fig. Q] Let 
Ct and Cr denote the left and right clusters of N nodes in 
Fig. Q] respectively. The A-by-A cooperative MIMO channel 
from Ct to Cr is given as 



Y = HX +W+Z 



(6) 



where Y is the JV-by-1 received vector at Cr, H is the N- 
by-A channel matrix from (O, X is the A-by-l transmitted 
vector from Ct, W is the iV-by-1 external interference vector 
with covariance matrix E, and Z is the A-by-1 additive 
Gaussian noise vector CAf(Q, /). Let p\ = j^rp tr(E) and 
P2 = N(Qpyi tr(E 2 ). The following theorem presents an 
achievable MIMO rate from C T to Cfl0E3 The proof is in 
Appendix [A] 

Theorem 5: The capacity C(H) of the cooperative MIMO 
channel from Ct to Cr is lower-bounded as 

8 2 { Pl +K' l Nf 



C{H) > N 



, 1/2 



X log 1 



{K 2 max{^ 2 , 7V 3 M- 1 }) 1 /2)2 
((l-6)K[N-6 Pl )\ 



QP, 



1 



for any < 5 < 1 with high probability as TV tends to infinity, 
where K[ and K' 2 are positive constants independent of D, 
L, A, and A and M is given as 



M = max < 1, 



XL 



(7) 



9 A more general version of Theorem \5\ was shown previously in Theorem 
1 in (5), where multiple antennas per node were assumed. M in Theorem [5] 
corresponds to the fourth term in the minimum of (4) in (9), which was 
obtained based on an approximation and, therefore, differs slightly from M. 
In deriving Theorem [5] however, no approximation is used, and therefore, the 
result is now exact. 

10 For the simplicity of presentation, Ct and Cr are assumed to be 
horizontally aligned. However, the proof of Theorem [5] in Appendix fA] can 
be easily extended to cases where Ct and Cn are not horizontally aligned, 
which will result in the same conclusion as in Theorem \5\ 



We have the following corollaries for certain classes of E. 

Corollary 1: If there is no external interference, i.e., W = 
0, the capacity C(H) of the cooperative MIMO channel from 
Ct to Cr is lower-bounded as 

(CP 
1 + K ijT N 

with high probability as N tends to infinity, where K' 3 and 
K' 4 are positive constants independent of D, L, N, and A. 
Proof: We choose 6 = 0(1) in Theorem [5] e.g., S = h. 



Corollary 2: If p\ = O(sN) and p2 = 
0(s u maxf^ 2 , A^M^ 1 }), where s = 0(1) and v > 1, the 
capacity C{H) of the cooperative MIMO channel from Ct 
to Cr, is lower-bounded as 

' NGP i 

1 



C(H)> 



Kl 



min{ N, M} log 



1 



NGP . 
L 2 ' 



with high probability as N tends to infinity, where K' 5 and 
K' 6 are positive constants independent of D, L, N, and A. 
Proof: We choose S = 0(s~ 2 ) in Theorem[5] ■ 

Note that M matches the DoF limit predicted in Q, EH) 
given as the product of the normalized cluster diameter -j and 
the angular spread y- between the clusters. 

In the following subsections, Theorems [T] and [3] for dense 
and extended networks, respectively, are proved by construct- 
ing a modified HC scheme. 



A. Dense network 

Let us construct the modified HC scheme for dense net- 
works consisting of h hierarchy levels. For an area of A', 
there are an order of A'n nodes with high probability^ For 
simplicity, we assume that there are exactly A'n nodes in our 
description of the scheme, but our results hold without such an 
assumption. Consider a (h + l)-tuple (no, Hi, n/j) G N h+1 
such that rih = n and rik-i < rife for all k £ [1 : h] and 
a ft-tuple (mo,mi,...,m/,_i) <G N h such that < rife for 
all k S [0 : h - 1]. For k g [0 : h], let A k = ^ and 
Lk = \/Ak- Consider a hierarchical structure of the network 
such that the network is divided into square areas of Ah-i, 
each of those square areas is again divided into smaller square 
areas of A^-2, and so on, i.e., at the fc-th hierarchy level for 
k G [1 : h], each square area of A^ is divided into smaller 
square areas of Ak-i- 

Let Xfc (rife , A) for k g [0 : h] denote the achievable 
throughput when a cluster of rife nodes operates as a network 
having its own rife source-destination pairs in an arbitrary 
manner. The following lemma gives Tfc(rife,A) as a function 
of Tfe-xK-i.A) forfcG [l:h]E 

Lemma 1: Fix k G [1 : h]. Consider a cluster of rife nodes. 
If, for any two clusters u and v of n^-i nodes inside the cluster 
of nfc nodes, a rate of Rk is achievable with high probability 
for the MIMO communication from mfc_i randomly chosen 

1 1 See Lemma 4. 1 in H for the proof. 

12 Since Xjj(nfc, A) for k 6 [1 : h] has a recursive form, it also depends on 
n , ...,7ifc_i,mo, ...,m fe -i. 
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Fig. 2. The big square and small squares represent a cluster of nodes 
and the clusters of n k _i nodes inside it, respectively. In Phases 1 and 3, 
the clusters of n k _i nodes operate in parallel according to the following 9- 
TDMA scheme: the total time of the phase is divided into 9 TDMA slots, 
and, in the i-th TDMA slot for i £ [1 : 9], clusters marked with i operate 
simultaneously while the other clusters are silent. 



nodes in cluster u to m^-i randomly chosen nodes in cluster 
v when other nodes in the cluster of nodes are silent, we 
have 



T k {nk,X) > 



K'i 



n k m k -i 



1 + m k -i/Rk mk-mk-i/Tk-i(nk-i, A) + n k 

(8) 



with high probability, where K" is a positive constant inde- 
pendent of both n and A. 

Proof: We construct a scheme for the cluster of n k nodes 
when it operates as a network having its own n k source- 
destination pairs in an arbitrary manner. From now on, a 
cluster indicates a cluster of n k ^\ nodes inside the cluster 
of n k nodes unless otherwise specified. We randomly assign 
the indices [1 : nk-i] to n k -i nodes in each cluster and let 
A u ,v for u € [1 : n " fc ] and v <G [1 : rik-i] denote the set 
{(v + i)n k _ 1 + 1|1 < i < Trik-i} of nodes in cluster u. 

The scheme consists of three phases. Let us first explain 
the scheme briefly from the perspective of source s in cluster 
u and its destination d in cluster v. In the first phase, source 
s in cluster u distributes its message to A u> s- In the second 
phase, Au.s performs MIMO transmission to A v ,d- In the last 
phase, destination d in cluster v collects quantized MIMO 
observations from A v .d and decodes the message. The detailed 
operation in each phase is as follows. 

• Phase 1: Each cluster operates in parallel according to 
the 9-TDMA scheme of [4] illustrated in Fig. |2] Source 
s in cluster u distributes its message to A u ,s, i-e-, the 
message of s is split into m k -i subblocks and each 
node in A u . s receives one subblock. For a cluster, this 
can be done by setting up m k -i subphases, where n k -i 
source-destination pairs in each of the subphases are 
assigned as follows: in subphase i G [1 : m k -i], {(s, (s + 
*)"fc-i + l)l s G [1 : n fe-i]} lS the set of n k -i source- 
destination pairs. Because Tk-i(n k -i, A) is achievable 
for a network of n k ^i nodes, n k -i/T k -i(n k ^i, A) time 
slots are needed for each subphase. Since there are m k ^\ 
subphases in each TDMA slot, Phase 1 needs a total of 
9m fc _infe_i/T fc _i(n fe _i, A) time slots. 

• Phase 2: We perform successive MIMO transmissions 
for all source-destination pairs, i.e., MIMO transmission 



from Au.s to A v .d for source s in cluster u and desti- 
nation d in cluster v. Since a rate of R k is assumed to 
be achievable for each MIMO transmission, m k _i/R k 
time slots are needed for each source-destination pair. 
Since we have n k source-destination pairs, a total of 
n k m k -i/R k time slots are needed for Phase 2. After 
Phase 2, each node quantizes the MIMO observations at 
a fixed rate Q subblocks per time slotf^l 
• Phase 3: Each cluster operates in parallel according to 
the 9-TDMA scheme of 0] depicted in Fig. |2] Des- 
tination d in cluster v collects the quantized observa- 
tions of the MIMO transmission intended for it from 
A v ,d and then decodes the message. Note that each 
quantized MIMO observation consists of Qmk-i/ Rk 
subblocks. By setting up m k -i subphases for a cluster 
similarly as in Phase 1, where n k -i source-destination 
pairs are assigned in each of the subphases, a total 
of (9QTO^_ 1 n I; _i)/(i?/ c rA : -i(n I; -i, A)) time slots are 
needed for Phase 3. 
In total, 

Qmk-in k -i/Tk-i{n k -i, A) + n fe m fc _i/i? fc 
+(9Qml_ 1 n k - 1 )/(R k T k -i(n k -i, A)) 

time slots are needed to transport n k messages, i.e., n k m k ^i 
subblocks. Hence, the constructed scheme yields an aggregate 
throughput of (0, which proves Lemma Q] 

In the above explanation of the scheme, we focused on the 
modified operation from the scheme of H and the resulting 
scaling law of the throughput. The readers should refer to |4| 
for a more detailed description of the scheme. However, taking 
those details into account does not change the throughput 
scaling. ■ 

The modified HC scheme is constructed recursively using 
the scheme in the proof of Lemma Q] for the original network 
of n nodes and using the multihop via percolation theory Q 
for clusters of no nodes at the bottom hierarchy. Now, let 
us show an achievable throughput scaling using the modified 
HC scheme with h hierarchy levels. Note that throughput 
achieved by the modified HC scheme depends on the choice 
of (no, ni, 7i/j) and (mo,mi,...,m/,_i). First, we choose 
TOfc_i as Gfc for fee [1 : h], where 



G k 



mm < n k -x, 



n k -\ 



For the 

(m , mi 
of R k = 



(n k n) 2 A log A 

modified HC scheme with the above choice of 
...,m/j_i), the following lemma shows that a rate 
0(Gfc/(logn) 7 ) is achievable for the MIMO trans- 
missions in Phase 2 at the fc-th hierarchy level for k £ [1 : h]. 

Lemma 2: In Phase 2 at the fc-th hierarchy level of the 
modified HC scheme for k G [1 : h], a rate of Rk = 
0(Gfc/(logn) 7 ) is achievable for the MIMO transmissions 
between clusters of G k nodes. 

Proof: Fix k G [1 : ft.]. In Phase 2 at the fc-th hierarchy 
level, we let each transmitting cluster of Gk nodes use a 

13 From Appendix II in (4), a strategy exists for each node to encode the 
observation of a MIMO transmission at a fixed rate Q such that the resultant 
mfc—i-by-mfc-i quantized MIMO channel has the same multiplexing gain 
as the original MIMO channel. 
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T k (n k , X) 



9mfc_infc_i/T fc _i(nfc_i, A) + n k m k -x/R k + (9QTO;_ 1 ?i fc _i)/(i? fc r fc _i(?i fc _i, A)) 

> — 5Li (9) 

9(Q + 1)(1 + m k -i/R k ) m k -in k -i/T k -i(n k -i,\) + n k 
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Fig. 3. For cluster V, clusters that operate simultaneously with V according 
to the 9-TDMA scheme are represented as shaded. The set of shaded clusters 
with dots represents Uy(l), and the set of shaded clusters with slash lines 
represents TJy{2). 



randomly generated Gaussian code according to CAf(0, P'l), 
where 



pi _ L t p _ n k p 



GGi 



(10) 



This satisfies the average power constraint of P per node 
because each node participates in the MIMO transmission for 
— fraction of time in Phase 2. 

Consider the MIMO transmission from cluster Ct of G k 
nodes to cluster Cr of G k nodes inside cluster V of n k nodes 
in Phase 2 at the fc-th hierarchy level of the modified HC 
scheme. To prove that the capacity of the MIMO channel 
from Ct to Cr is at least linear in Gfc/(logn) 7 , we use 
Corollary [2] By adopting the notations for Corollary [2] let D 
and L denote the side length of Ct and Cr and the distance 
between the centers, respectively, and let M be given as ||7). 
The MIMO transmission from Ct to Cr is interfered by 
the MIMO transmission by G k nodes in each cluster of n k 
nodes that operates simultaneously with V. Let Uy denote 
the set of clusters of n k nodes that operate simultaneously 
with V. Then, Uy can be split into subgroups according to 
their distance to V such that the i-th subgroup Uy(i) contains 
8i or less clusters of n k nodes and the distance between the 
centers of V and each cluster in Uy(i) is greater than or 
equal to (3i)L k for i = 1,2,..., as illustrated in Fig. [3] 
The number of such subgroups can be simply bounded by 
n/n k . Let denote the number of clusters of n k nodes 

in Uy(i). Then, the MIMO channel from Ct to Cr is given 
as ©, in which G k is substituted for N and the interference 
W is given as W = YJl=i HiX h where H t is the G fc -by- 
(\Uv(i)\G k ) channel matrix from Uy(i) to Cr and Xi is the 
(\Uy(i)\G k )-by-\ transmitted vector from Uy{i). 

Now, let us show that the covariance matrix E of W satisfies 
the conditions in Corollary [2] for s = \ogn and v = 2. Let 



Fi = £$£±Hi. Then, we have 



L 2 



Pi 



G k GP 

1 n/n„ 



7 tr(S) 



L 2 



, n/n k 

i— l v 7 



and 



92 



G k {GP') 2 tr(S2) 



L 4 



G k G 2 



tr 



i/n k 



G k G 2 

-(- 

G k \L k 



4 / n/n 



where (a) is from the following lemma, which is a direct 
consequence of the matrix trace inequality in ifTTl . 

Lemma 3: If Aj's are positive semidefinite matrices, then 
tr(Ei^) 2 < (Ei*r 1/2 (Af)) 2 . 

By applying similar bounding techniques as those for 
tr(Fif*) and tr (FF*FF*) in Appendix g] to tr(F^*) and 
tr(FiF*FiF*), we can show 



tr(F i F*) = 0(\U v (i)\Gl) 



and 



tr(FF*F t F*) = OdUy^mzxlGlGiM- 1 }) 
with high probability, where A/ is given as 

\ D 2 ( ( D 2 ^ + " 
M = maX ( 1 'A^( 1+ ( 1 ° g AZ; 

Because \U v {i)\ < 8i and (-^ M~ l < M~ x , we have 



G k GP 



7 tr(E) = 0((logn)G k ) 



and 



L 4 



G k (GP>) 



— tr(E^) =0((logn)^max{GtG^/- i }). 
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Hence, the conditions in Corollary [2] are satisfied for s = logn 
and v — 2. From Corollary [2] for s = log n and v = 2, the 
capacity C{H) of the MIMO channel from Cr to Cij is lower- 
bounded as 

CW " Ab "~~^ 6 108 I 1 + ^l + ^logn 



L 2 



(log n) 

W min{G fc ,M} 

" 7 (logn) 7 
for some constant K' 7 with high probability, where (a) is from 
the choice of P' in dTOb . Furthermore, because = 0(A _1 ) 
and £ = n( we have M = fi( ?S=l ). 

\{n k n)^ (n fc n)2AlogA 1 

Hence, we have C(H) = Vt{G k / (\ogn) 7 ), which proves 
Lemma [2] ■ 
Now, by substituting G k and K'^logn) 7 for mfe_i and 
mk-i/Rk in <EK where if^' is a positive constant independent 
of both n and A, we have the recursive form of T k (n k , A) for 
k G [1 : ft] for the modified HC scheme given as 



T k {n k ,\) > 



K" 



n k G k 



(11) 



(logn) 7 n fe _iG ! fe/T fe _x(nfe-i, A) + n fc 

where Jfg is a positive constant independent of both n and A. 

The following lemma gives an achievable throughput scal- 
ing using the modified HC scheme with ft hierarchy levels 
when we choose (no, m, nu) that maximizes (fTTT i for 
fee [1 : ft]. The proof is at the end of the present section. 

Lemma 4: In dense networks, the modified HC scheme with 
ft hierarchy levels achieves 



T h (n,X) > 



(l ogn )7/ l+ i (Alog A~ 1 ) Tb <".^ h > 

with high probability, where Ch is a positive constant inde- 
pendent of both n and A, 

b(n, A, ft) 

'ft + 1 if log„(Alog A -1 ) < — A(ft) 
k if - A(fc) < logjAlogA" 1 ) < -A(fc- 1) 

for some k G [2 : ft] 
1 if -A(l) <log„(AlogA- 1 ) 



and 

5„. = 



u2 h 



Proof of Theorem^ Fix e > e' > 0. Let ft be the smallest 
integer such that ft > 4 an d let n be the smallest integer such 
that (7ft + 1) log„ logn < ^ and log n i/ 2 logn M < jE^r. Let 
us define functions yo(x) and y k {x) for fee [1 : ft + 1] for 



x < — ± as 



y (x) = (1 - e') min{-x, 1}, 

y k {x) = S k - r k x - (7ft + 1) log„ log n. 

Fix fee [1 : ft + 1]. We will show that yfe(x) is larger than 
2/o (^c) f° r a H x < — 5- Let us first show that £/&(— 1) > 1 — 
2/k(-l) is given as 

yfc(-l) = 4 + T fc - (7ft + 1) log n logn 



> S k + T k - - 



If 1 < k < §, we have 



/i-fe 



+ fc — 1 > 3 ( - 



c 
4 



If | < k < ft + 1, we get 



h- k 



1>*>2> ? 



ft 4 
> 2 > ?' 



Thus, we conclude that y k (—l) > 1 — |-. Now we are ready 
to show j/fe(x) > yo(x) for all x < —\. Note that < r k < 1. 
For x < — 1, 



iy fe (:c) = */&(-!) - T fe (a; + 1) > J/fc(-l) > 1 



yo(x). 



For -1 < x < -i, 



J/fc(») = 2/fc(-l) ~ TfcOc + 1) 
>t/ fe (-l)-(x + l) 
> -(l-e')z 
= 2/o (a:). 

Hence, we prove that min{y k (x) : 1 < k < h + 1} > j/o(x) 
for all a? < — |. By letting x = log„(A log A -1 ), we 
equivalently prove that the achievable rate of the modified 
HC scheme with h hierarchy levels in Corollary [3] is lower- 
bounded as 



31+h-u + 2>»-«( u - 1) : 



3 i+fc-u + 2 fc-u( u _i) T h (n,A)> 



r" 



A. 3 fe ~"(3+t;)-2'' 



(log n) 



7/1+1 



- for 



for u e [1 : h + 1], where A(u) = 3 fe-„( 4+ ^)_ 2 i+h- 
« G [1 : /i]. 

The following corollary is obtained straightforwardly from 
Lemma [4] 

Corollary 3: In dense networks, the modified HC scheme 
with h hierarchy levels achieves 



> C' h min 

1 log A -1 



Now, we have 



T h (n, X) > C' h min 



T h (n,\) > 



C> 



(A log A _1 ) Tfc 
, n > 

l-c' 



1< fc < ft, + 1 



1< k < h + 1 



(logn) 7 '^ 1 [(AlogA- 1 )^ 

with high probability, where C' h is a positive constant inde 
pendent of both n and A. 

Now we are ready to prove Theorem Q] 



logA-i' 

C' h min {(A- 1 ) 1 - 10 ^- 1 lo « , n} 
C' h min |(A- 1 ) 1 - lo « n V2 lo s« M ; n J 
> C^minjA- 1 ^} 1 " 6 



(a) 

> 



l-e' 



1-e' 
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where (a) is because n M < A < n L / 2 . Hence, Theorem Q] 
is proved. ■ 

B. Extended network 

In extended networks, both y/G and the distance between 
nodes is increased by a factor of s/n as compared to those 
in dense networks. Hence, for the same transmit power, the 
received power at each node remains the same as in dense 
networks. By rescaling the space, let us consider an extended 
network as an equivalent dense network on a unit area but 
with the wavelength reduced to A?i~ 1//2 . Since the wavelength 
is given as An -1 / 2 in the equivalent dense network, Theorem 
13 is proved. 

Proof of Lemma® First, consider the case of b(n, A, ft) — 
ft + 1. Since A (ft) = 1, this implies A log A -1 < -nT 1 . In this 
case, G k is n k -x, and hence, the recursive form of T k (n k , A) 
in ( fTTb becomes 



Tit (nit, A) > 



K" 



n k -in k 



(logn) 7 n|_ 1 /Tfc_i(n i (._ 1 ,A) + n k 



(12) 



for all k G [1 : ft]. Note that T (n ,A) = ©(^) by using 

the multihop via percolation theory Q] for the cooperation for 

I— , fc+i 
the clusters of uq nodesl 14 ! By choosing n k -i = n k +2 that 

maximizes ST% for k G [1 : ft], 7/i(n, A) > (ij^jVh+T^ 7 ^ 

is obtained. Because <5^+i = and r; l+ i = 0, Lemma [4] is 

proved for the case of b(n, A, ft) = ft + 1. 

Next, consider the case of 6(n, A, ft) = ft' for some ft' <G 

[1 : ft]. Let us first assume that G k is n k -i for k G [1 : ft' — 1] 

and is f° r fc G [ft' : ft]. For the choice of 

(n k n) 2 A log A- 1 

no, JT. 1; n^_! that maximizes ( II lb under this assumption, 
we will show that the range of A where the assumption is valid 
is the same as the range of A corresponding to b(n, A, ft) = ft' 
in Lemma [4] 

Since G k is assumed to be n k -\ for k £ [1 : ft' — 1], we 
obtain 



TV_i(nfc/_i, A) > 



, h'+i 



(logn) 7 (' l '- 1 )+ 1 



For k £ [ft' : ft], G k is assumed to be 



(nfcn^AlogA- 1 

hence, the recursive form of T k (n k , A) in dill ) is given as 



(13) 
and 



T k (n k , A) > 



K" 

^3 



(logn) 7 



n k -in k 



^ 1 /T fc _ 1 (n fc _ 1 ,A)+< /2 ((AlogA- 1 ) 2 n)5 



(14) 



Let us assume that T k (n k , A) for k £ [ft' 

a h' ,fc 



1 : ftl has 



the form of 



c k 



for some positive 



(l ogn) 7 fc + l ( (AlogA -l ) 2„)^/, fc 

constants C k , aw.k, and (3h',k independent of both n and 



A. Then, the recursive formulas ah' fc = 



-i+i 



2(2-a„/ 



and 



14 In (3), a path-loss exponent larger than two is considered and a multihop 
via percolation theory is shown to achieve &(y/n). For the path-loss exponent 
equal to two, however, it achieves 0( ) due to the interference power 
proportional to logn. 



Ph',k = 1 a 2(2-a + , 2l3h 'j k 1 are obtained by choosing Ufe_i 



2: J 



1-2^,, 



n fc _i=?i fc - --^((AlogA" 1 ) 2 ^ 515 ^ 7 ^ 

that maximizes ( TPfl ) for fc G [ft' : ft]. Using the conditions 

ah',h'-i = h^fT and Ph',h'-i = from ([13), av.fc and 
for fc G [ft' : ft] are given as 

z x+k-h' +2 i+ k -h'( h > _ 

a ' l '' fe ~ 3 1 + fc -' l '2 + 2 1 + fe -' l '(ft'-l)' 

£l+k—h' _ 2^+k-h' 

Ph>,k = 3 i+fc-ft' 2 + 2 1 + fe -' l '(ft'-l)' 

Because n./, = n, n k for fc £ [ft' — 1 : ft] is given as ( fT5] >. 

Now, the range of A that makes the assumption, i.e., G k is 
n fc _i for fc G [1 : ft'-l] and is ^ for k £ [ft' : ft], 

(ra fc n)2 A log A- 1 

valid is given as 

f(nn^)~ 1/2 < AlogA" 1 < (nn^-i)" 1 / 2 if ft' g [2 : ft], 
[(nnj)- 1 / 2 < AlogA- 1 if ft' = 1. 

(16) 

By using and n^_i from ( fT3T >, we can show that the range 
of A in (fTol l is the same as the range of A corresponding 
to b(n, A, ft) = ft' in Lemma [4] Hence, we prove that for 
b(n, A, ft) = ft', the modified HC scheme with ft levels 
achieves 



T h (n,\) > 



(l 0gn )7h+i ((AlogA- 1 ) 2 n)' 3 '-'-'- 



(logn) 7 ^ 1 (AlogA- 1 ) 2 ^'.^' 

Since 5 h , = a h > ih - Ph',h and t w = 2^h',h, Lemma g] is 
proved for the case of b(n, A, ft) G [1 : ft]. ■ 

V. Conclusion 

We characterized the information-theoretic capacity scaling 
of wireless ad hoc networks from Maxwell's equations without 
any artificial assumptions. The capacity scaling is given as 
the minimum of the number of nodes and the DoF limit 
given as the ratio of the network diameter and the wavelength. 
Accordingly, a network becomes DoF-limited if A = il(n _1 ) 
in dense networks and A = f2(n -1 / 2 ) in extended networks. 
Our results indicate that the linear throughput scaling in J4l 
that was shown under the i.i.d. channel phase assumption is 
indeed achievable to within an arbitrarily small exponent in the 
non DoF-limited regime. In the DoF-limited regime, the DoF 
limit characterized by Franceschetti et al. in that generally 
has higher scaling than that of the multihop scheme can be 
achieved to within an arbitrarily small exponent by using the 
modified HC scheme. 

We also considered a channel model with a path-loss 
exponent a larger than two. In dense networks, the throughput 
scaling using the modified HC scheme for a > 2 remains 
the same as when a = 2. However, the throughput scaling 
using the modified HC scheme is decreased for a > 2 in 
extended networks due to the power limitation. This suggests, 
as a further work, an upper bound considering both the 
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n k = n 



^ 1 + h — h + 3/1— fe _ 2^ — 'l ( 

3l + 'i-'i'+2' 1 -'i'(-l + h') 



(A log A -1 ) 31+/.- +2 h-h' ( _ 1+h0 



(15) 



DoF limitation due to the channel correlation and the power 
limitation due to the power attenuation over the distance. 

Appendix A 
Proof of Theorem[5] 

The capacity C(H) of the MIMO channel from C T to C R 
is lower-bounded as 

C(H) = max I(X: Y) 

f(x):E[\X^]<P 



(a) 

> I(X G ;Y) 

W , det(J - 

> log- 



E + PHH*) 



log 



niltd 



det(J + E) 

GP , 
T 2 



- Kj 



nf =1 (i 

> log 11 - 

i=l 



I- 
GP 



Ki) 



N 



GP 
GP , 



E[x] 



N~E 



log 



1 



GP 
TP 



EM 

> NPt(k > (1- 5)E[«])log 

C = 5 /VPr( K > (l-<y)E[«]) 

3P ((1-5)E[ 7 ]- 



1 



GP 



(1-5) E[ K ] 



GP 



E[x] 



x log 1 + 



> N „ r L J log 



5E[ X ]) 



G'P 
GP 



em ; 

((1-<5)E[ 7 ]-«5EM) % 



(/) 

> JV 



E[«a] 



1 



GP 



EM 



E[7]) 2 



(E 1/2 [X 2 ] 



x log 1 



GP 

T 2 " 



f-E 1 / 2 ^ 2 ]) 2 
((1-<5)E[ 7 ] 



<5EM) 



1 



GP 



EM 



(17) 



for any < 5 < 1, where X G is CAf(0,PI), n is cho- 
sen uniformly among the eigenvalues Ki,i = 1,...,N of 
^p(E + PHH*), x is chosen uniformly among the eigen- 
values = 1, . . . ,iV of (§75 E, and 7 is chosen uniformly 
among the eigenvalues = 1,...,N of ^-HH*. (a) is 
from choosing the input AT as ATg, (6) is because assuming 
Gaussian interference minimizes the mutual information for 
given noise and interference covariance matrices lfl2l . lfl3ll . 
(c) is because the geometric mean is upper-bounded by the 
arithmetic mean, (d) is because E[k] = EM + E[ 7 ], (e) is 



from the Paley-Zygmund inequality 0J, fl4l . and (/) is from 
Lemma [3] 

Note that E[x] = p\ and E[% 2 ] = ^2- To get a lower bound 
on ( fTTI i, we need a lower bound on E[ 7 ] and an upper bound 
on E[ 7 2 ]. Let F = -j=H. Then, F ik = a ik exp(-j2vr^), 
where = Note that constants a m i n and a max exist 



independent of D and L such that a n 
i, G [1 : N}. First, E[ 7 ] is given as 

1 



for all 



E[ 7 ] 



N 



tr (FF*) 



1 N 

- y 



»,fc=i 



SinCe a min < l^fcP < 2 lax 

Next, E[ 7 2 ] is upper-bounded as 



we have E[ 7 ] = 9 (TV). 



E[ 7 2 



-tr (FF*FF*) 



1 



N 



EZ7 Z?* Z7 77* 

i,j,k,l=l 



1 E 



N 
1 



(ij,fc,;)e*i 

E 



FihF;jFuF, 



< «Lx(2TV 2 - /V) + 



1 



E 

AT 

C (2/V 2 -/V) + - J] Q 



FikF^FjiF^ 



ijkl 



l,j,k,l = l 
i<j,k<l 



where ^1 = {(i, j, k, l)\i,j, k,l £ [1 : N], i = j or k = I}, 
* 2 = {(i,j,k,l)\i,j,k,l £ [1 : N],i + j and k ? I}, and 
Qijki — dikduajkciji cos (^-{dik — du — + <%)). 

Note that Qijkt's for all 1 < i < j < and 1 < 
k < I < N follow an identical distribution, but they 
are not necessarily independent of each other. Nevertheless, 
PTfira Z)*-J- fc . l=1 QijW strongly converges to E[Qi 2 i 2 ] as 
the following lemma shows, where the expectation is over 
uniform node distributions. 

Lemma 5: The sample mean 



JV 2 (AT- 

strongly converges to E[Qi2i2]- That is, 



i<j,k<l 



'ijkl 



Pr lim 



jv->oo n 2 (n — iy 



N 

E 

i<j,k<l 



'ijkl 



E[Qi 



The proof of the above lemma is given in Appendix IB] 
Furthermore, the following lemma gives an upper bound on 
E[Qi2i2], which is proved in Appendix ICl 
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Lemma 6: E[Qm 2 ] = 0(M^ 1 ). 
From Lemmas and [6] we have E[7 2 ] = 
0(max{7V 2 ,7V 3 M -1 }) with high probability as N tends to 
infinity. 

Now, by using the bounds on E[7] and E[7 2 ], Theorem [5] 
is proved. ■ 

Appendix B 
Proof of Lemma[5] 

Let us first present a theorem on the strong convergence of 
the sample mean of a sequence of not necessarily independent 
random variables. The proof is in fl5l . 

Theorem 6: Let {X m }^ =1 be a sequence of not necessarily 
independent complex-valued random variables, each of which 
follows an identical probability density function f(x) such that 
E[X] = and E[|A"| 2 ] and \X\ are bounded. Suppose that 



e- 



K>1 



K 3 



E 



m<K 



< 00. 



(18) 



Then, the strong law of large numbers holds for {X m }^ =1 , 



i.e. 



lim — X m = almost surely. 



Now, let us prove Lemma[5]using Theorem[6] For idgN, let 
U w and V w denote the collections of random variables given 
as 

U w = {Qijki - E[Q m2 ]|l < i < j < to, l<k<l<w} 
and 



V w = 



if to = 1 

U w \ U w -\ otherwise 



Note that \U U 



w 2 {w-l) 2 



= I) 3 , and IX=i^ = 
U w . Let for i G [1 : (to — l) 3 ] be the i-th random variable 
in V w with an arbitrary ordering. We construct a sequence 
{X m }m=i °f random variables as follows: for to G N, we let 
X m denote the random variable , p i=1 where m! is 

the integer satisfying YT=i 1^1 + 1 < m < YhLx \Vi\- 

Let us show that {X m }^ =1 satisfies the conditions in 
Theorem [6] First, it is easy to show that E[|X m | 2 ] and \X m \ 
are bounded, i.e., E[|X m | 2 ] < a max and \X m \ < 2a 4 nax . Next, 
the left-hand side term of the inequality in (fl~8j) is written as 



E- 



K>1 



K 3 



E 



E x « 



m<K 



Consider two random variables X mi = Qi 1 j 1 k 1 i 1 — E[Qi2i2] 



and X r , 



= Qi,. 



j 2 k 2 i 2 



E[Qi2i 2 ] in {X m }% =1 . If 



{h,ji}f){i2,j2} = and {h,h} f){k 2 ,h} = 0, X rni and 
X m2 are independent of each other, and hence, E[X mi X m2 ] = 
0. Otherwise, |E[X mi X m2 ]| = \E[Qi 1 j 1 k 1 i 1 Qi 2 j 2 k 2 h} - 



E[Qi2i2] 2 | < 2a max . Using this, let us get an upper bound 



i E E E[X mi X„ 



K 

nii<K m 2 <K 

for each K G N as follows. 



K 



w- s {w-iy 



for some w € N: In this case, {X m |l < 
to < K} is U w . For each random variable X rni in U w , 
( w ~ 2 ^ ^"'~ 3 - > random variables in U w are independent of 
X mi . Thus, we have 



J^3 E E E l Im i 1 "'^ 

w 2 (w-l) 2 -{w-2) 2 (w-'S) 2 



mi<K m2 < A" 

l " 



< — 7Y(2a* lax ) 



w 2 (w-l) 2 ~(w-2) 2 (w-3) 2 



w 4 (w-l) 4 

< C" n 8 I 

" 1 max («; 2 ( U ;-l) 2 /4)5/4 



C 1 a 



1 



for some positive constant C". 

— L ^ — — < K < — v , ; for some w £ N: Let K\ = 

W 2 (W-1) 2 , T y V?(W + 1) 2 rp, 

— y — — — and 71 2 = — — — — . Then, we get 

^3 E E E [X mi X m2 ] 

mi <if 777 2 < K 

4E E |E[X mi X m2 ]| 



mi <K m2<K 

-i? E E iE[x mi x m2 n 

< ^ 2 (2aL x r 2(W + 1)2 ~ (W ' 1)2(W ' 2)2 
_ A' 3 4 

8 w 2 (w + l) 4 - (ty - 2) 2 (w - l) 2 (w + l) 2 



tf 4 (w; — l) e 



< 



y 2 "max 



(w 2 (w + l) 2 /4) 5 / 4 

1 



- r"'n 8 

< C 9 a 



2 "max -5/4 



w 'o 8 — 
2 "max j-^. 



for some positive constant C 2 '. 



Let C 



. m&x{C" ,0%}- Now we have 



A'>1 



E 



E X " 



m<K 



= E E E E[x TOl x m2 ] 



K 

K>1 mi <Km 2 <K 
- C " E JF5/4' 



K>1 
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Ct 



Cr 



1) 




s 


u 




t 



Fig. 4. Two nodes u and v in Ct and two nodes s and t in Cr. 6 and 
denote Zuvs and Zvsu, respectively. 



which is finite. Hence, from Theorem [6] 
Pr 

which concludes the proof of Lemma [5] 



( Urn ^ V X m = ) = 1, 

\ m<if / 



Appendix C 
Proof of Lemma|6] 

Consider two uniformly and independently distributed nodes 
u and v in Ct and two uniformly and independently dis- 
tributed nodes s and t in Cr. Consider a cartesian coordinate 
system whose origin is at the bottom left corner of Ct- Let 
z u = (x u ,y u ), z v = (x v ,y v ), z s = (x s ,y s ), and z t = (x t ,y t ) 
denote the coordinates of nodes u, v, s, and t, respectively. 
Let S(X) and S(X\Y) for random variables X and Y denote 
the support of the probability density function f(x) and the 
support of the conditional probability density function f(x\y), 
respectively. Let Ti C S(z u ,z v ) denote the set of (z u , z v ) 
such that the line through z u and z v intersects Cr, and let T 2 
denote S(z u ,z v ) \ Fx. Let 9 = Zuvs and let A(z u , z v , d av ) 
denote the length of S(6\z u , z v ,d sv ) where the length of an 
interval [a, b] is defined as b — ajij Let <fi = Zvsu and 
let <fii, (f>2, (j>3, and ^4 denote Zvsu when z s is fixed at 
(L, 0), (L + D, 0), (L + D, D), and (L, D), respectively. Let 
ItfJ =min{|^|, \<h\M \4>i\}. See Fig. g] 

Now we are ready to prove Lemma [6] E[Qi2i2] is upper- 
bounded as 

E[Ql212] 

= E[E[Q stuv \z u , z v ]] 

2 2?r 
= E[E [a su a sv cos( — (d su - d sv ))\z u , z v ] 



+ E 2 [a su a sv sin(— (d su - d 8V ))\z u , z v ] 



X 



2tt 



< E[E [a max | E[a su cos( — (d su - d sv ))\z u ,z v ,d 



X 



sv\ I **V\ 



,2tt 



+ E [a max | E[a stl sin( — (d su - d sl ,))|z„, d su ] \\z u , z v ]]. 

Furthermore, | E[a su cos(^-(d sll - d sv ))\z u , z v , d sv ]\ and 
I E[a su sm(^-(d su - d sv ))\z u , z v ,d sv }\ are upper-bounded as 

\E[a su cos(— (d su - d sv ))\z u ,z v ,d sv ]\, 

15 Here, we follow the convention that ABAC is the counterclockwise angle 
from B to C and \ZBAC\ < tt. 



2tt 

\E[a su sin(— (d su - d sv ))\z u , z v , d sv ]\ 



XD 



< j A(z„,z„,d„) v w^rc if(2 M ,^)eri ^ 

for some positive constants K' n and A^i, wnere Vuv — 
\Vu — Vv\- These upper bounds are derived at the end of this 
appendix. 

Using the above upper bounds, E[Q s t uv \z u , z v ) is upper- 
bounded separately for the cases of (z u ,z v ) G Ti and 

(z u ,z v ) G r 2 . If (z u ,z v ) e Ti, we have 



,2tt 



^ E [flmax I E [o-s-u COs( ^ {d su dgvf) \ %u j 2*u , G^^] 1 1 Z u , 2^] 

2 . 27T 

~t~ E [flmax | E [dsu SUl( — — {d su )) I ^-Li 3 0<y j C?sd] 1 1 -^u 5 ^-u] 



A 



< 2E' 



K[ ia 



I ^,2 

max 



XD 



< K 12 a max 



A{z U: z V: d sv ^ V yuvL 
XL 



VuvD 



for some positive constant K[ 2 . If (z u , z v ) € T2, we have 

E[Qs£'itt; j Z U , Zyj 

— E [flmax I E [dsu dgv^ \ ^u 5 -^u 5 ^su] 1 1 7 ^v] 

~f~ E [o-max I E \o<su sin( — — {^d su — d sr ,y^ | z u , , g? S d] 1 1 ^ u , Zx\ 
,2tt 



< a max E[|E[a SM cos( — {d su - d sv ))\z u , z v , d av ]\\z u , z v ] 

^21 a max-^ 



A 

a^ ax E[|E[a SM sin( — (d au - d sv ))\z u , z v , d sv ]\\z u , z v 



< 2a max E 



— A 22 "max 



d sw A (z„ , z v , d sv ) sin | (j) m \ 
X 



\Dsin|0 m | 

for some positive constant A'^- 

Because E[Q s tuv\z u , z v ] < a max , E[Q stul ,|z u , z v ] is upper- 
bounded as 



E[Qsi^t; \Zu: Zv 
< 



amax min< | 1, K 12^d] if {z Ul Z v ) ^ T X , 

111 ax 



C min j 1 , A 22 D sin A ! , | if (z u ,z.) € T 2 . 



L 22D s in|0 m | 

Finally, E[Q stul; ] is upper-bounded as follows: 

E[Q s t ul j] 

E[Qstuv 1-2-^5 ^v]f{z U: Zy)dz u dz v 

~\~ I E [QsfiiD j z u , z v ] y (z n , z v ~)dz u dz v 



/ min<l,A 12 T^ffi 

z v )dz u dz 

J ri I VuvD J 

A 



V; 



mini 1, A 22 



A> sin I 



f{%ui z v ^dz u dz v 
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< at 



/S(z„,z„) 



1) K i2——R \ f( z u, z v )dz u dz v 



< at 



4xax / min i !) K 22 n ■ I , I f /( Z «' 

•As(z u ,z„) I £>sin|0 m |J 
/ min j 1, i^ 2 — h j f(yuv)dy uv 



^23«max / min \ !, K 22 



s(\4>i\) 



A 



D sin |0i 



< a 



XL 



max K' 31 D 2 



1 + log 



AL 



for some positive constants K' 23 , K' 31 , and -K^- Since 



E[Q S 



Lemma [6] is proved. 



Now it remains to show the upper bounds in ( 1191 , The upper 
bounds in ( fT~9b are obtained by using the following lemma, 
whose proof is at the end of this appendix. 

Lemma 7: Let g(x) be a periodic Lebesgue-integrable 
function on K with period p > that satisfies <7(x) = 
— g(x + p/2) and max l6 i |<?(x)| = 1. Let h(x) be a non- 
negative and Lebesgue-integrable function on R. Consider an 
interval [a, 6] and constants ci ^ and c 2 . If a partition 
II = {xo, x\, x TO } of [a, b] exists for finite m such that 
a = xq < x\ < ... < x m = b and h(x) is monotone on each 
interval [xj_i,Xj] for i G [1 : to], we have 



from Lemma [7] where d su is such that 



d 3u + 4 



d\e[ 

dd x „ 



dd KU > 



d\e[ 



dd su 



for all d su G 5(d s „|z u , z v , d sv ). 
In the same way, 

,2tt 



| E[a su sin( — (d su - d sv ))\z u , z v , d sv ]\ 



< 



12a n 



d 3u + _ 



d|0| 



dd su . 



(22) 



+ 2 l!P~ddsu separately for the cases of 



We bound f- 

(z u ,z v ) € Ti and (z Ul z v ) G r^. Without loss of generality, 
assume that < x u . First, consider the case of (z u , z v ) G Ti. 
Note that is decreasing in d su & [d sv ~d uv , ^ d 2 v - dU 
and is increasing in d su G [y/d 2 sv - d 2 vl d sv + d uv \. For the 
case of ( 

d™, " ^L]> and hence, we have d SM = d sv -d uv . 
Therefore, we have 



d su + 4 



dd S n, 



dd* 



d\9\ 
dd su 



dd* 



\o\ 



d\e\ 



g{c\x + c 2 )h(x)dx 



< m 



h(x)dx 



where x G K is such that /. 2|<=l1 h(x)dx > f* + 2 '<=i h(x)dx 
for all x G \a, b — t^-tI. 

To obtain an upper bound on ( 1201 ) using Lemma [7J we first 
show that S(d su \z u , z v ,d sv ) consists of a finite number of 
intervals such that a su f(d su \z u , z v ,d sv ) is monotone for each. 
Because \6\ and d su have a one-to-one relationship, we have 



®Suf(d su | Z u , Z v , d su ) / ( | 9 | | ) Z v , (i su ) 



ddsu 



where 



gigj 

ddait. 



, sin |0| d s „ sin |0| 



We can easily show that a su 



ML 

dd B 



has at most two criti- 



cal points from its derivative with respect to |0| and that 
S(\6\\z u , z v ,d sv ) can be split into at most four intervals 
such that f(\9\\z u , z v ,d sv ) is a constant for each. Hence, 
S(\9\\z u , z v ,d sv ) can be split into at most six intervals such 
that a su f(\8\\z u , z v , d sv )-^p- is monotone for each, implying 
that S(d su \z u , z v , d sv ) can also be split into at most six 
intervals such that a su f(d S u\z u , z v , d sv ) is monotone for each. 
Because a su < a max and f(\6\\z u , z v ,d sv ) < 2 <; , we 
have 



A(z„ 



,d,vV 



E[a s 



< 



,2tt 



cos( ^ {d su d sv ) ) | z u ; z v , d sv \ | 



12a„ 



d BU + -, 



ddsu 



ddsu 



(21) 



where \6\ is |Zmus| when d su = d sv 



for given 



z u , z Vl d sv . We have the following bounds on cos \8\: 



1 - Kni 



XD 

VuvL 



< cos |(9| < 1 - K' n2 \6\' 



(23) 



for some positive constants K[ n and K' ll2 . The upper bound 
holds since \6\ -C 7r and the lower bound is obtained as 



cos \e\ 



I 2 



> 1 - 



Qidsvduv 

X 



-duv + i 



= 1 sin \lu\ 

Duv 

>l-K> ™- 

— 1 '"■111. T 

where uj is the angle between the line through z u and z v and 
the horizontal line crossing z u and (a) is because (z u ,z v ) G 
Ti. From d23l ), we have 



d4„ = i9i < iM 13 XD 



dd 



VuvL 



for some positive constant K' n3 . Using this bound in ( ETT i and 
(l22l . the upper bounds in (TT~9b for the case of (z M ,z„) G Ti 
are obtained. 

Now consider the case of (z u ,z v ) G T 2 . The following 
lemma gives a lower bound on \<f>\ for the case of (z u , z v ) G 
r 2 , whose proof is given at the end of the present appendix. 

Lemma 8: When (z ui z v ) G T 2 is given, \<f>\ is lower- 
bounded by | (j> m | . 
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2tt 

E[a su cos(— (d s „ - d sv ))\z U7 z v ,d sv }\ 



S{d su \ z u ,z v ,d sv ) 



2-7T 

cos(-^- (d su dsv))@ j 8uf\d'8u \%ui z>vi d S v)dd su 



(20) 



From the above lemma, 

d\0\ 1 



< 



sin |0| d sin 



and hence, 



d|g| 
dd su 



dd su < 



A 



2d,„ sin I 



Using this bound in (|2TT > and (1221 1. the upper bounds in (fj~9t 
for the case of z„) G are proved. ■ 
Proof of Lemma [7} It can be easily shown that for any 
interval [01,61] on which h(x) is monotonically increasing, 
we have 



g(c\x + C2)h{x)dx 



< 



In 



h(x)dx, (24) 



and for any interval [02,62] on which h(x) is monotonically 
decreasing, 



g(c.\x + C2)h{x)dx 



< 



«2 + : 



h(x)dx. (25) 



From d24l) and d25l ). Lemma [7] is directly obtained. ■ 
Proof of Lemma [3} Assume that e T2 is given. 

Then, S((f>\z u , z v ) is included in either [— 7r,0) or (0,7r]. \<fi\ 
is given as follows: 



\4>\ 



arccos 



Fix x,. The derivative of 



*2d su d S v 



with respect to y s has 



the form of a rational polynomial , where <?2(2/s) is 

positive for every y s € [0,-D] and gi(y s ) is a cubic function 
of y s with a positive cubic coefficient whose roots are given 



Since = when y s 



x a {y u -yv)-x l ,y u +x u y l 



, which 



violates the assumption (z u ,z v ) <E T2, 5(y s ) contains at most 
one root of gi(y s ). Because the cubic coefficient of gi(y s ) is 

4-d 2 —d? 

positive, su 2d 3 ^ — — is maximized when y s is or D, and 
hence, |</>| is minimized when y s is or D. 

In a similar way, we can show that |0| is minimized when 
x s is L or L + D for fixed y s . Thus, |0| is lower bounded by 
\<f>m\- ■ 

Appendix D 

Extension to a Path-loss Exponent Larger than 

Two 

In this appendix, we consider the channel model with a path- 
loss exponent larger than two, i.e., the discrete-time baseband- 
equivalent channel gain (0) between nodes k and i at time m 
is changed to 



H ik [m] 



( .2vr , . A 
^-^exp^-^Hj (27) 



with the path-loss exponent a > 2. For a = 4, this channel 
model approximates the channel when there are a direct path 
and a reflected path off the ground plane between transmit and 
receive antennas with a sufficiently large horizontal distance. 
For a > 2 and a ^ 4, however, the channel model d27| i is 
not a direct consequence of Maxwell's equations, and hence, 
the DoF limit characterized in [2l is not valid for this channel 
model. 

Now, let us present throughput scalings using the modified 
HC scheme constructed in Section |IV] for the channel model 
in (1271) . In dense networks, we can get the same throughput 
scaling in Theorem Q] In extended networks, the throughput 
scaling using the modified HC scheme is decreased because 
the network becomes power-limited. For the same transmit 
power, the received power at each node in extended networks 
is decreased by a factor of n"/ 2 " 1 as compared to the dense 
network. By rescaling the space, an extended network can 
be considered as an equivalent dense network on a unit area 
but with the average power constraint per node reduced to 
P/n Q / 2_1 instead of P and the wavelength reduced to An" 1 / 2 
instead of A. Note that the average power constraint P/tW 2 " 1 
per node is less than P. As the bursty modification of the HC 
scheme in [4], we use the bursty version of the modified HC 
scheme, i.e., we use the modified HC scheme with operating 
power P for n 1_Q / 2 fraction of the time and keep silent for 
the remaining fraction of the time. This satisfies the average 
power constraint per node P/n a ^ 2 ^ 1 and yields an aggregate 
throughput scaling of n 1_ 7 min {-^/riA -1 , n} c . 
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